A Distributed Platform for Sanskrit Processing

نویسندگان

  • Pawan Goyal
  • Gérard P. Huet
  • Amba P. Kulkarni
  • Peter M. Scharf
  • Ralph Bunker
چکیده

Sanskrit, the classical language of India, presents specific challenges for computational linguistics: exact phonetic transcription in writing that obscures word boundaries, rich morphology and an enormous corpus, among others. Recent international cooperation has developed innovative solutions to these problems and significant resources for linguistic research. Solutions include efficient segmenting and tagging algorithms and dependency parsers based on constraint programming. The integration of lexical resources, text archives and linguistic software is achieved by distributed interoperable Web services. Resources include a morphological tagger and tagged corpus.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Collaborative Platform for Sanskrit Processing

Sanskrit, the classical language of India, presents specific challenges for computational linguistics: exact phonetic transcription in writing that obscures word boundaries, rich morphology and an enormous corpus, among others. Recent international cooperation has developed innovative solutions to these problems and significant resources for linguistic research. Solutions include efficient segm...

متن کامل

Verbal Roots in the Sanskrit Wordnet

Wordnets (WN) are accepted worldwide as useful lexical tools for Natural Language Processing (NLP) . Projects for building WNs of different languages of the world are going for quite some time. The scenario for Indian Languages is also encouraging. Indian Institute of Technology Bombay (IITB) has successfully created WNs for Hindi and Marathi.There have been more than 100,000 hits of the sites ...

متن کامل

Completeness Analysis of a Sanskrit Reader

We analyse in this paper differences of linguistic treatment of Sanskrit in the Sanskrit Heritage platform and in the Paninian gram-

متن کامل

Fuzzy Modeling and Natural Language Processing for Panini's Sanskrit Grammar

Indian languages have long history in World Natural languages. Panini was the first to define Grammar for Sanskrit language with about 4000 rules in fifth century. These rules contain uncertainty information. It is not possible to Computer processing of Sanskrit language with uncertain information. In this paper, fuzzy logic and fuzzy reasoning are proposed to deal to eliminate uncertain inform...

متن کامل

Sanskrit as a Programming Language and Natural Language Processing

In this paper represents the work toward developing a dependency parser for Sanskrit language and also represents the efforts in developing a NLU(Natural Language Understanding) and NLP(Natural Language Processing) systems. Here, we use ashtadhayayi (a book of Sanskrit grammar) to implement this idea. We use this concept because the Sanskrit is an unambiguous language. In this paper, we are pre...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012